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Abstract 


'A  geometric  programming  model  is  proposed  to  determine  the 
optimal  design  of  the  CPU  and  its  matching  storage  hierarchy.  The 
objective  function  is  the  maximization  of  system  reliability  subject 
to  performance  and  budgetary  limitations.  Examples  illustrating  the 
use  of  the  model  are  presented. 
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1.  Introduction 


An  important  area  of  concern  during  the  design  of  a  computer 
system  is  the  design  of  its  storage  subsystem.  Several  optimization 
models  for  the  design  of  storage  hierarchies  are  available  [3,4,8,12]. 
These  models  typically  optimize  system  performance,  measured  by 
throughput  or  average  hierarchy  access  time,  subject  to  budgetary  lim¬ 
itations.  In  the  previous  efforts,  no  consideration  is  given  to  the 
reliability  issues.  In  systems  designed  for  avionics,  space,  and  cer¬ 
tain  military  applications,  reliability  is  of  utmost  concern.  In  this 
paper,  we  develop  an  optimization  model  for  linear  storage  hierarchies 
with  the  objective  of  maximizing  system  reliability  subject  to  a  cost 
and  a  performance  constraint.  The  capacities  of  various  memory  levels 
are  the  decision  variables.  An  extended  model  also  includes  the  CPU 
speed  as  a  decision  variable. 

Chow  [3,4],  considered  the  design  of  a  linear  storage  hierar¬ 
chy  with  the  objective  of  minimizing  the  average  access  time  subject 
to  a  cost  constraint.  The  present  paper  is  an  extension  of  Chow's 
model  to  include  reliability  considerations.  Other  related  work  is  by 
MacDonald  and  Sigworth  [8] ,  and  Trivedi  and  Sigmon  [12] . 
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2 .  Model  Development 

The  storage  hierarchy  consists  of  n  levels,  M^,  M2»  Mn» 
as  shown  in  Figure  1.  The  hierarchy  management  follows  the  staging 
rules  proposed  by  Gecsei  [6].  The  execution  is  out  of  the  fastest 
level  M,  which  may  be  thought  of  ns  a  cache.  Information  transfers 
are  between  adjacent  levels  only.  By  convention,  the  higher  the  level 
(smaller  the  index)  in  the  hierarchy,  the  smaller  its  capacity.  Rules 
of  operation  of  such  a  hierarchy  are  described  in  [6,10]. 

The  capacity  of  memory  level  Nh  ,  will  be  denoted  by  (bits), 
and  the  block  size  by  s^  (bits).  Since  the  entire  address  space  of 
the  program  must  be  contained  in  level  n,  we  assume  that  the  capacity 
of  that  level,  Kn,  is  a  fixed  input  parameter  to  the  model.  The 
decision  variables  are  the  capacities  K2,  ...,  Kn_i* 

The  behavior  of  the  program  can  be  characterized  by  a 
reference  string.  We  assume,  however,  that  the  program  behavior  and 
the  effect  of  storage  management  strategy  are  compacted  into  a  single 
function  called  the  success  function  which  gives  the  probability  that 
a  storage  reference  from  the  CPU  is  found  in  a  given  level.  The  suc¬ 
cess  function  H^,  for  level  i  depends  on  the  capacity  of  level  i  (K^), 
the  block  size  of  level  i  and  all  of  the  lower  levels 
(si,  s^+^,  ...,  sn) ,  and  the  block  replacement  algorithm.  If  the 
hierarchy  management  rules  satisfy  certain  reasonable  properties  [6], 
it  can  be  shown  that  the  success  function,  H ^ ,  depends  only  upon  the 
capacity  and  the  block  size  s^.  That  is. 

Hi  =  H^(K^;s^)  1  =  1,.:,... ,n  • 
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The  semicolon  is  used  above  to  emphasize  the  fact  that  is  a 
decision  variable  while  s^  is  a  fixed  parameter.  The  miss  ratio  func¬ 
tion  is  defined  to  be 


F.  =  1  -  H. 
l  l 

Let  t^  be  the  average  time  to  transfer  a  block  of  size  sj_^ 
from  the  ifc^  level  to  level  i-1.  The  access  times  t^,  .  ..,  tR  are 
considered  fixed  parameters  in  our  model.  Due  to  the  linear  organiza¬ 
tion,  the  time  to  fetch  a  block  from  level  i  and  percolate  it  (or 
appropriate  sub-blocks)  up  to  level  1,  denoted  by  T^,  is  given  by 


The  average  access  time,  T,  of  the  entire  hierarchy  is  the 
weighted  average  of  the  access  time  to  various  levels,  that  is, 
n 

T  =  Z  h.T. 
i-1  1  1 

The  weight  h^  is  the  probability  of  a  hit  to  level  i  and  misses  to  all 
the  previous  levels  j  <  i.  Thus 


h.  -  H.  -  H.  ,  -  F.  .  -  F. 

l  l  l-l  l-l  l 


1—1,2, . . .n 


and 


F0  =  1  '  Fn  "  0 


Therefore,  the  expression  for  T  can 


T  -  fci  + 

1  =  2 
n 

=  ti  +  .ViFi-i(Ki-i;si-i> 
1  =  2 


by  convention, 
be  simplified  to 
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The  cost  of  the  ic  memory  level  is  modeled  by  a  posynomial 
function  of  its  capacity,  that  is 


DEVCOST.  =  ?  c. -K . 13 

l  j  l]  1 


ci  j  2.  0  '  i  =  1  /2, . . .  ,n 


The  total  svstem  cost  is  then  given  by  Z  DEVCOST.. 

i=l  1 

The  failure  rate,  X^»  of  memory  level  i  is  a  posynomial  func¬ 
tion  of  its  capacity,  that  is 

B.  . 

Xi  =  Z  a ±  j Kf±  1 11  ,  d^  >  0  ,  i  =  l ,  2 , .  .  . n  . 

Note  that  the  military  standard  M1L-217B  [9]  suggests  that  X^  is  a 
linear  function  of  capacity  K^.  A  failure  in  any  level  is  assumed  to 
cause  the  entire  hierarchy  to  fail.  Thus,  from  the  reliability  point 
of  view,  it  is  a  series  system.  If  we  further  assume  that  the  failure 
rate  of  each  level  is  independent  of  its  age  (or  equivalently,  the 
lifetimes  are  exponentially  distributed),  the  hierarchy  failure  rate, 
X,  is  also  time-independent,  and  is  given  by  [1], 
n 

X  =  I  Xj  . 

i=l  1 

For  such  a  system,  minimizing  the  failure  rate  is  equivalent  to  maxim¬ 
izing  the  reliability  [1]. 

We  are  now  ready  to  define  the  optimization  problem  for  the 
design  of  the  storage  hierarchy.  Assume  that  the  hierarchy  access 
time  is  not  to  exceed  Tn  and  the  system  cost  is  constrained  by  BUDGET. 


min  X  =  z  2  d  —  K, ' 
i=l  j  13  1 


such  that 
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T  =  fci  +  i  To 

n  ^ii 

Z  Z  C ■ -K.  J  <  BUDGET 

i=l  j  13  1 


Ki  >  0  ,  i=l, 2, . . . ,n-l 


(lb) 

(lc) 

(Id) 


If  we  assume  that  the  miss  ratio  function,  F^,  is  a  posynomial 

function  of  capacity  K.  (that  is, 

~dii  1 

Fi  =  “  aiiKi  J  '  aii— ^  '  i=l » 2 , . . . ,n-l) ,  then  the  design  problem 

j  3  3 

(1)  above  is  a  standard  geometric  programming  problem.  From  the 
theory  of  geometric  programming  [13] ,  we  get  the  following  result: 


Theorem  _1: 

Any  relative  minimum  of  the  design  problem  (1)  is  also  its  global 
minimum. 


The  implication  of  this  result  is  that  any  standard  technique 
for  nonlinear  optimization  will  locate  the  global  minimum  of  the 
design  problem.  Alternatively,  geometric  programming  techniques  can 
be  used  to  solve  the  problem.  In  the  next  section,  we  consider  a  sim¬ 
plification  of  this  problem  so  that  a  closed  form  solution  can  be 
obtained . 


3.  Simplified  Design  Problem 

In  the  previous  section,  the  miss  ratio  function,  the  device 
cost  function,  and  the  failure  rate  function  were  assumed  to  be  quite 
general  posynomial  functions.  In  this  section  we  make  the  following 
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simplifying  assumptions  (that  is,  restrict  the  number  of  terms  in  the 
posynomials) : 


F  ^  —  a^Kj,  t  i-1 , 2 , . • • ,n  1 


.Pi 


^i  _  doi  +  dliKi  '  x=l,2,. ..,n 


A 


DEVCOST.  =  c_.  +  C,.K/  ,  i  =  l ,  2 , 
l  Oi  li  l 


/  n 


(one  term  only) 
(two  terms  only) 
(two  terms  only) 


Note  that,  since  K  is  fixed,  X  and  DEVCOST  are  fixed.  Define 

n  n  n 

n  3n 

=  Z  dn.  +  d1  Krn  , 

0  iZ1  Oi  In  n 


°0  ■  J^Oi  +  clnKn"  • 

Now  the  design  problem  (1)  can  be  rewritten  as 

n-l  p. 

min.  d  +  Z  d,  ,K* 

0  i=1  li  i 


(2a) 


n 


-*1-1 


1  _2  i  i-i  i-i  -  "0 


n-l 


<  BUDGET  -  Cg 

K.  >  0  , 

i ""  1  f  2  ^  •  •  • ,  n-l 

(2b) 

(2c) 

(2d) 


With  these  assumptions,  it  can  be  shown  that  [11]  the  solution 
to  design  problem  (1)  can  be  decomposed  into  two  steps.  The  first 
step  is  to  obtain  the  solution  to  the  following  subproblem: 


n-l 


min.  dg  +  £  dliKi 

i=l 


Pi 


(3a) 


n  ~*i-l 

fci  +  . ;2tiai-iKi-i  <  To 


(3b) 
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K  ^  >  0  ,  i-1, 2, . . . ,n-l  (3c) 

In  particular,  we  have  remover]  the  cost  costraint.  Assume  that  a 
solution  to  problem  (3)  is  given  by  ,  J<2»  ...»  K  Now  ^ 

n-i  y. 

i=l  11  1  -  ° 

then  this  is  also  the  solution  to  the  overall  design  problem  (2).  On 
the  other  hand,  if 
n-1  Yt 

i  C, -K . 1  >  BUDGET  -  cn 
i=l  il  1  U 

then  the  original  design  problem  (2)  possesses  no  feasible  solution. 
In  other  words,  the  requirements  of  the  design  problem  are  incompati¬ 
ble.  Due  to  this  result,  we  will,  henceforth,  omit  the  cost  con¬ 
straint  from  the  problem  specification.  We  will  assume  that  the 
requirements  are  compatible. 


With  the  further  restriction  that  c(^  =  c(  and  p^  =  p  for  all  i, 
the  following  closed  form  solution  to  design  problem  (3)  can  be 
obtained  [ 11] . 


As  an  example,  we  take  the  input  parameters  shown  in  Table  1 
for  the  design  of  a  3-level  storage  hierarchy.  The  data  on  the  miss 
ratio  function  is  derived  from  [8).  All  other  input  parameters  are 
based  on  the  data  reported  in  [2,5].  The  resulting  optimal  capacities 
are  also  shown  in  Table  1. 
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Next  we  consider  the  problem  with  the  same  parameters,  except 
that  the  maximum  allowable  hierarchy  access  time,  is  to  be  varied. 
The  result  of  such  a  sensitivity  analysis  is  shown  in  Figure  2.  We 
have  plotted  the  optimal  hierarchy  MTTF  (equal  to  the  reciprocal  of 
the  system  failure  rate)  as  a  function  of  the  maximum  allowable 
hierarchy  access  time  T^.  In  Figure  3,  we  have  plotted  the  cost  of 
the  optimally  designed  hierarchy  as  a  function  of  . 

4 .  Including  the  Selection  of  CPU  Speed 

The  model  of  linear  storage  hierarchy  discussed  up  to  this 
point  does  not  address  the  selection  of  CPU  speed.  We  have  also 
ignored  the  effect  of  CPU  instruction  execution  delay  from  performance 
considerations.  Let  tg  be  the  average  instruction  interpretation  and 
execution  time  (i.e.,  l/t0  is  the  instruction  execution  rate  assuming 
an  infinitely  fast  memory  hierarchy) .  Let  A  be  the  average  number  of 
memory  references  per  instruction.  Gecsei  and  Lukes  [7]  estimate  that 
A  =  2  while  Snow  and  Siewiorek  [2]  estimate  that  A  =  1.162. 


The  average  execution  time  per  instruction  is  now  given  by 

to  +  MtI  +  if2tiF(Ki-l>  1 


Apart  from  the  memory  capacities  , • • . »Kn_ i •  we  consider 
tg  also  to  be  chosen  by  optimization.  We  assume  that  tg  is  a  power 
function  of  the  complexity,  Gg,  of  the  CPU: 


9 


I 


Gq  may  be  equated  to  the  gate  count,  or  any  related  measure  of  com¬ 
plexity.  The  CPU  failure  rate,  is  also  a  power  function  of  the 
complexity: 


^0  d00  +  d10G0 


Po 


Finally,  the  CPU  cost  is  modeled  by 


c00  +  C10G0 


The  design  problem  (1)  can  be  redefined  as  follows: 


n_1  Pi  Po 

min.  d  +  2  d  K  +  d1()G 

i  =  l 


(5a) 


s-t-  B10G0  0  +  AItl  +  i  TSI  -  Boo  (5b> 


y  n  y 

and  coo  +  C10G0°  +  ^<'01  +  ciiKii)  i  BUDGET 


(5c) 


G0  >  0 

K .  >  0 

1 


( 5d ) 
(5e) 


As  in  section  3,  we  use  the  miss  ratio  function 
Fi  =  aiKi  '  2, . . . ,n-l 

IER  is  the  minimum  required  instruction  execution  rate  for  the  system 


consisting  of  the  CPU  and  the  storage  hierarchy.  Also, 


Pn 


d.  =  Z  dn.  +  d.  K 
0  £_0  Oi  In  n 


As  before,  we  can  solve  the  subproblem  ignoring 


the  cost  constraint.  Assuming  that  c(^  =  c(,  =  |B  for  i=l,  2, . .  .  ,  n-1 , 

it  can  be  shown  that  the  CPU  delay  tg  must  satisfy  the  equation  [11]: 


where 


_  «  «VP> 

P  (c(+p)c(0 


This  is  a  nonlinear  equation  with  one  unknown  and  can  be  solved  using 
the  standard  iterative  techniques  to  obtain  the  optimal  value  of  tQ. 
Now  the  optimal  values  of  the  memory  capacity  (i=l , 2 , . . . , n-1)  can 
be  obtained  from  equation  (4)  after  substituting 


1  (-i- 

a  'ier 


-  Ati  -  V 


for  the  term  Tg  -  t^. 


As  an  example,  assume  we  want  to  design  a  system  which  is 
capable  of  executing  at  0.2  MIPS.  Other  input  parameters  are  shown  in 
Table  2.  The  parameter  values  are  based  on  data  reported  in  [2,5,8]. 
The  optimal  solution  is  also  shown  in  Table  2.  The  design  roughly 
corresponds  to  a  typical  PDP-11/60  configuration  [2] . 


Next,  we  perform  sensitivity  analysis  varying  the  specified 
value  of  the  IER.  The  system  MTTF  and  the  system  cost  are  plotted  as 
functions  of  IER  in  Figures  4  and  5,  respectively. 


5 .  Concluding  Remarks 

We  have  developed  a  reliability-oriented  design  model  for 
linear  storage  hierarchies.  The  decision  variables  are  the  capacities 
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of  each  memory  level  and  the  speed  of  the  CPU.  The  design  problem  is 
set  up  as  a  geometric  programming  problem  with  the  objective  of  maxim¬ 
izing  system  reliability  subject  to  a  cost  and  a  performance  con¬ 
straint.  Any  relative  optimum  is  a  globally  optimum  solution  to  the 
design  problem.  Closed  form  expressions  for  the  device  capacaties  are 
obtained.  Examples  illustrating  the  usefulness  of  this  model  are  also 
presented . 
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Table  1  :  Example  of  a  Linear  Storage  Hierarchy  Design 

INPUT  PARAMETERS  : 

Number  of  Levels,  n  =  3 

Desired  Hierarchy  Access  Time,  T^  =  2.41  psec 

Address  Space  Size,  Kn  =  500K  words 

Miss  Ratio  Function,  F.  =  a.K.^ 

l  ii 

where  ,  a^  =  576 

,  a2  =  7056 

,  c(  =  1.449 


Memory  Level 

Access  Time 
per  word 

Block  Size 
(words) 

t . 
l 

.  its!_ 

d.  . 
li 

per  10^hr/word 

Cli 

$/word 

1  (cache) 

|  0.1  ps 

1  32 

|  0.1 

|  0.066 

|  1.60 

2  (main  mem) 

1  1  ps 

[  128  | 

1  32  | 

[  0.004625  | 

|  0.160 

3  (back  up  ) 

(  memory  ) 

10  ps 

- 

,  1280 

- 

0.016 

dQ  =  0  ,  CQi  =  0  ,  dQi  =  0  ,  p.  =  1  ,  Yi  =  1  for  all  i  . 


OUTPUT  (OPTIMAL  DESIGN) 


Memory  Level 


Optimal 

Contribution 

Contribution 

Capacity 

to  Access 

to  Failure 

K .  (words) 

Time 

Rate  6 

1 

(ps) 

per  10  hr 

1.2K 


0.1 


79.2 


Contribution 
to  cost 

(?) 

1920 


2 

|  44.35K  | 

0.641  | 

|  204.7  | 

7080 

3 

|  500K  j 

1.669  | 

1 

*000 

Totals 


2.410 


283.9 


17,000 


Table  2  : 


Example  Design  of  a  CPU  and  its  Matching  Storage  Hierarchy 

INPUT  PARAMETERS  : 

Number  of  levels,  n  =  3 

Desired  Instruction  Execution  Rate,  IER  =  0.2  MIPS 

Address  Space  Size,  K  =  500  K  words 

n 

Miss  Ratio  Function,  F.  =  a . kT^ 

1  IX 

where  ,  a1  =  576 

,  a 2  =  7056 

,  c(  =  1.449 

Number  of  Storage  References  per  Instruction,  A  =  1.162 
B0,0  =  0  '  =  1  /  B1  0  =  2,298  x  gate-psec 

dQ0  =  0  ,  =  1  ,  d1Q  =  0.00988  failures  per  gate  per  106  hours 

c00  =  $20'000  »  Y0  =  1  *  c10  =  0.61  $  per  gate 

All  other  parameters  are  equal  to  their  corresponding 
values  in  Table  1. 

OUTPUT  (OPTIMAL  DESIGN) 


Memory  Level 

Delay, t. 

(ps) 

Complexity 

Contribution 
to  Failure 
Rate 

per  10  hr 

Contribution 
to  Cose 

($) 

0 

(CPU) 

1.915 

11971 

Gate 

Count, 

G0 

118 

27,300 

1 

(cache) 

0.1 

1.11K 

72 

1780 

2 

(main  mem) 

32 

41. 4K 

Capac . 

191 

‘  ■u  "  "  "■  i 

6620 

3 

(backup 

memory) 

1280 

500K 

(words) 

8000 

.... 

Totals 

381  | 

43,700 
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FIGURE  3.  Hieran 
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System  Cost  vs.  Instruction  Execution  Rate 
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